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Abstract 

For many years it had been believed that steric compatibility of helix in- 
terfaces could be the source of the observed preference for particular angles 
between neighbouring helices as emerging from statistical analysis of protein 
databanks. Several elegant models describing how side chains on helices can 
interdigitate without steric clashes were able to account quite reasonably for 
the observed distributions. However, it was later recognized (Bowie, 1997 and 
Walther, 1998) that the "bare" measured angle distribution should be corrected 
to avoid statistical bias. Disappointingly, the rescaled distributions dramati- 
cally lost their similarity with theoretical predictions casting many doubts on 
the validity of the geometrical assumptions and models. In this report we 
elucidate a few points concerning the proper choice of the random reference 
distribution. In particular we show the existence of crucial corrections due to 
the correct implementation of the approach used to discriminate whether two 
helices are in contact or not and to measure their relative orientations. By using 
this new rescaling, the "true" packing angle preferences are well described, even 
more than with the original "bare" distribution, by regular packing models. 
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Introduction 

The issue of the pairwise packing between hehces in proteins was addressed soon after 
hehcal structures had been suggested. A number of models were developed, mostly 
devoted to surface complementarities upon packing. The "knobs into holes" model, 
first introduced by Crick [1] and elaborated by Richmond and Richards [2], aimed 
to find the best steric fit between regular helices. Chothia, Levitt and Richardson 
[3, 4] recognized the importance of "ridges" and "grooves" formed by residues with 
different sequential distances ( "ridges into grooves" model) . Efimov [5] tried to relate 
the packing angle between the two helices with the preferred rotational states of 
the side-chains along them. A comprehensive analysis was eventually carried out by 
Walther et. al. [6], by modeling helix packing as the superposition of the two regular 
lattices that result from unrolling the helix cylinders onto a plane and contain points 
representing each residue. The six "preferred" angles predicted by this last model are 
consistent with earlier results and with the histograms of the experimentally observed 
packing angles (see Fig. 7 of Ref. [6]). The agreement between theoretical modeling 
and experimental data was remarkable, although not perfect (see Ref. [6] for a more 
detailed discussion). 

The success of steric models in providing an explanation for the most prevalent 
packing angles was however put under discussion after the observation made by Bowie 
[7] that statistical corrections must be applied to the values collected from experimen- 
tally determined structures before true interaxial angle preferences can be revealed. 
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Indeed, the helix-helix packing is defined to occur only when: 

• [CI] the segment of closest approach (SCA), of length dn, between the two finite 
helix axes is shorter than a prefixed threshold dc', 

• [C2] this segment intersects both hehx axes at a perpendicular angle. 

In Ref. [6], two helices were considered to be in contact if they were satisfying 
both conditions CI and C2. Note that Walther et al. [6] selected interacting hehcal 
pairs from their database of native protein conformations, by considering distances 
between all possible inter-hehcal heavy atom pairs. We used a different selection 
procedure involving distances between inter-helical Cq atom pairs (see Methods for 
details). None of these particular choices really affects our geometrical analysis, and 
condition CI, as defined above, is just the simplest way of defining an equivalent 
distance constraint. 

Condition C2 is needed to ensure face-to-face packing of the two hehces, justifying 
thus the use of theoretical modeling based on the steric interdigitation of the helices. 

If the SCA is coincident with the global segment of closest approach (GSCA), of 
length d, between the two straight lines which are obtained by indefinitely prolonging 
the helix axes, condition C2 is then automatically satisfied. In such a situation, 
which corresponds to effectively dealing with helices of infinite length (see Fig. la), 
the reference probability distribution of interaxial angles P{^) is simply the spherical- 
polar distribution between any two random vectors, namely P{p) ~ sinil [7]. 
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Walther and co-workers [8] realized that the finiteness of hehx axes is crucial in 
modifying the angular dependence of P{fl), because requiring that the SCA intersects 
both axes at a perpendicular angle introduces new restrictions depending on the 
packing angle Q. This can be understood by looking at Fig. 2a, where it is shown 
that, fixing one helix position, and assuming the GSCA to be orthogonal to the page, 
the second finite helix axis may then be placed only within a plane parallel to the 
paper plane and such that its starting point lies inside the dark shaded parallelogram 
A. 

The probability of placing the starting point of the second helix in A is propor- 
tional to its area and therefore to sinfl. Consequently the probability P{fl) for se- 
lecting a particular packing angle is proportional to the product of the spherical-polar 
contribution and of the sin fl effect due to the finite length condition: 

P{n)^sm^n (1) 

With this new random reference distribution the actual angular propensities need 
to be reconsidered. As is clear from Fig. 2 of Ref. [8], the normalized frequencies 
reveal a prominent representation of packing angles near 0° and 180° (not expected 
by theoretical models) whereas the predicted optimal steric packing angles manifest 
themselves, at most as shoulders in the distribution of propensities. The predictivity 
of geometrical models is cast under serious doubts after this analysis. 

Our first observation is that the random distribution P{fl) ~ sin^ fl is correct only 
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and only if the condition of mutual perpendicularity between the SCA and the two 
axes is strictly fulfilled. But is this the real situation when statistical histograms are 
derived? 

As a matter of fact, condition C2 was relaxed by Walther et al, by admitting a 
small tolerance: a total deviation t = T1 + T2 (ti and T2 are the complementary angles 
to the angles 9i and 62 formed between the SCA and helix 1 and helix 2, respectively 
(see Fig. lb)) was accepted up to a threshold Tmax (Tmax — 5° in [6] and Tmax = 1° in 
[8]). 

At first, one might wonder why such a threshold needs to be used since it does not 
efi^ectively increase the number of data contributing to the histograms. However, we 
believe that this choice is indeed necessary because of the ambiguity in the definition 
of the axis direction in natural helices, which introduces an intrinsic uncertainty in the 
computation of the 9, r angles (see Methods for a detailed explanation of how the 
axis is reconstructed in the typical case of a non-ideal bent helix and for an estimation 
of such uncertainty Ail). A relaxation of condition C2 is thus crucial, if we want to 
analyse correctly data extracted from the database of real native protein structures. 

We will show in this paper, by means of partly semi-analytical geometrical argu- 
ments and eventually numerical simulations, that such threshold effect does drasti- 
cally change the random reference distribution. Having measured helix-helix packing 
angles from a set of 600 proteins representative of the PDB native structures, we will 
then reanalyse the packing angle distribution after its proper rescaling with the newly 
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found reference distribution. 

Methods 

Databank and helix pair selection 

We employed the same ensemble of 600 proteins considered by Chang et. al [9], which 
consisted of sequences varying in length from 44 to 1017, with low sequence homol- 
ogy and covering many different three-dimensional folds according to the Structural 
Classification of Proteins (SCOP) scheme [10]. The structures were monomeric and 
determined using x-ray crystallography. We collected 4397 helices each with at least 
four consecutive residues classified as hehcal in the PDB files. The average number 
of residues of these helices was 11.5. Two hehces were defined to be in close contact, 
if at least one interhelical contact between C" atoms was present, with a maximal 
threshold distance of 5. 5 A (analogous to condition CI in the text). Only hehx pairs 
separated in sequence by at least 40 intervening residues were considered, to get rid 
of possible correlations induced by short loops. The resulting data set consisted of 
1460 closely packed helix pairs. 

Helix axis reconstruction 

The reconstruction of the helix axis from the coordinates of the atoms of the 
corresponding residues is a critical step in the determination of packing angle prefer- 



7 



ences. Since real helices can be bent, we adopted the procedure described by Walther 
et al. in ref. [6], in which a local axis is associated to every consecutive residue pair 
along the helix. The overall axis is thus a broken hue consisting of short segments. 

A good starting approximation for the local axis a^, based on the the four 
atom positions rj_i, rj, rj+i, rj+2, was introduced by Chothia et al. [4]. We em- 
ployed a sligthly modified definition, where we first define the normalized bond vec- 
tors hi = (rj+i — Ti) I |rj+i — Vi\. In this way, the set of three orthonormal vec- 
tors tj, Vj, Uj, the natural reference system associated with the C" atom trace, can 
be defined without the distortions due to the fluctuations of the bond length be- 
tween consecutive atoms in the following way [11]: tj = (bj -|- bj_i) / |bj + bi_i|, 
Vj = bj X bi_i/ |bi X bi_i|, Ui = ti X v,. If the C" atom trace followed a perfect ideal 
helix, the vector = Uj x u^+i would be parallel to the helix axis. We then initially 
determined the local axis between residues i, i -|- 1, as the vector parallel to aj, of 
length 1.45 A, which has the geometric centre of the closest four consecutive atom 
positions, rj_i, r^, rj+i, rj_|_2, as its midpoint. The two local axes at the helix termini 
were obtained by simply prolonging the neighbouring ones. 

We then applied a smoothing procedure similarly to Walther et al. [6]. The 
'new' local axes were obtained by averaging the direction of the closest three 'old' 
ones, a^_i, a^, a^+i, while conserving its midpoints. For the two local axes at the helix 
termini we averaged over the direction of the closest two available ones. To ensure the 
continuity of the overall axis, the inner hinges of the broken axial line were computed 
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as the midpoints between extremities of consecutive local vectors obtained from the 
running average. After repeating the whole procedure twice, the standard deviation 
of the distances of each C" atom from the reconstructed helix axis was 0.16A. 

The interaxial packing angle between two helices was computed bewteen the local 
axis pair for which the minimum distance of closest approach was achieved. In case 
the segment of closest approach (SCA) intersected an inner hinge of the broken global 
axis, the local axis direction was defined as the average of the two corresponding local 
vectors. The whole discussion concerning the perpendicularity of both local axis 
direction with the SCA applies only to cases involving a terminal local axis, since 
face-to-face packing is anyway ensured in case of contact between inner local axes. 
Packing angles are positive if the background helix is rotated clockwise with respect 
to the frontal helix when facing them. The angle fl — 0° {fl — ±180°) corresponds 
to parallel (antiparallel) helices with respect to their sequence direction. 

Imposing the further requirement that the SCA intersected both local axis di- 
rections at a perpendicular angle within a threshold Tmax plays a critical role, as 
discussed in the text. For the three different Tmax values considered in this work, 
Tmax = 7°, 11°, 15°, we collected datasets of 765, 837, 899 closely packed helix pairs, 
respectively. 

As a final remark we report the mean value of the angle AQ, which is formed 
between consecutive local directions averaged over all inner hinges of all helical axes 
which were reconstructed using the procedure discussed above. We found AQ = 
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6.7° ± 4.0°, strongly supporting the necessity of allowing a similar threshold when 
imposing angular constraints as in condition C2. 

Results and Discussion 

Geometrical analysis 

We will now sketch the geometrical consequences of relaxing condition C2 within 
a given threshold, in order to understand by means of a simple argument how this 
effects the random reference distribution. A comprehensive analytical treatment is 
in principle feasible but quite cumbersome. Numerical simulation will ultimately be 
the preferential approach to extract the corrected random reference distribution for 
interaxial angles. 

Within the admitted threshold Tmax, cases similar to the one described in Fig. 
lb may now occur. The SCA between the two helices, which intersects helix 1 at 
one of its axis ends and helix 2 at some internal point, forms an angle 9i which 
deviates from | by less than Tmax, whereas the GSCA does not intersect both hehces 
(in this situation ti 7^ 0, T2 = 0). li d < dc, the two helices are considered to be in 
contact. The condition ti < Tmax is equivalent to the condition that GSCA intersects 
helix 1 within a distance AZmax from the end of its axis. This limit is reached when 

a a — E T 

Cl C'min — 2 'max- 

There are also cases in which ti — and T2 7^ 0, when the SCA intersects hehx 
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2 at one of its axis ends and helix 1 at some internal point, and cases in which both 
Ti and T2 are not zero, and the SCA intersects both helices at one of their axes ends 
(Fig. Ic). In this last case, the geometrical condition implied by ti + T2 < Tmax, and 
involving the distances Ali, AI2 from the end of both helix axes to their intersection 
with the GSCA, is more complicated (and is discussed in the caption of Fig. 1). 

Using the visual representation of Fig. 2b, we can say that allowing condition C2 
to be satisfied within the threshold t = T1 + T2 < Tmax, opens up the possibility for the 
starting point of helix 2 to lie in the portion of space (Q) formed by the four lightly 
shaded parallelograms (two parallel to helix 1 with sides hi and A = AZmaxSinQ, 
and two parallel to helix 2 with sides /i2 and A) and by the four remaining white 
corner regions ( whose boundaries are defined by hyperboles, see Figure 2 caption for 
details), which surround parallelogram A. 

To compute the area of Q is not trivial, because of the hyperbole-shaped regions, 
but the result is obviously Q dependent. For example a simple trigonometric calcu- 
lation shows that: 



A= <^ 



IsinQl > sinT, 



■\/ sin^ H— sin^ Tmax 



max 



(2) 

00 IsinQl < sinTma-if 



This implies that the area of Q diverges for |sin f2| < sin Tmax, which already points 
towards the fact that relevant corrections are possible even for small values of Tmax- 

To conclude our analysis we need to enforce also condition CI. By using the 
Pythagorean theorem it is easy to see that for any fixed d < dc, CI is satisfied for 
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any point (thought as starting point of hehx 2) whose distance As{dR) — ydj^ — d? 
from parallelogram A is less than F = As{dc) — \Jd1 — d^. These points belong to 
the region Q' , shown in Fig. 2c, formed by the 4 parallelograms (two of sides hi and 
F and two of sides /i2 and F) and the four circular sectors (of radius F) surrounding 
A. The area of Q' is independent on Vt (see caption of Fig. 2c). 

All the points in the portion of space H — Q HQ' resulting from the intersection 
of Q and Q' satisfy both condition CI and C2. 

Therefore, the probability P'^'^^^^d) of selecting a particular angle with a toler- 
ance Tmax, at a given distance d < dc, is given by the product of the spherical polar 
term sin fl [7] and a term proportional to the area of parallelogram A plus the area 

of n. 

When I sinO| < sinTmax, Eq.2 shows that A diverges and thus: 

P^— (d) ~sinQ(/ii/i2sinQ + 2(/ii + /i2)r + S) (3) 

where B is the area of the corner regions in H. 

Since F does not depend on fl, the second term in the bracket will eventually 
dominate the small |sinO| behaviour of P^™''^((i). In practice, the actual relevance 
of this effect can be appreciated only after integrating P'^'"'^(d) over d, since the 
relative weights of the different terms in the bracket vary with d. Such computation 
is quite cumbersome and we have rather chosen to get P'^'°^'' by simulating random 
helices which satisfy the contact conditions within a threshold Tmax, and to extract 
numerically the normalized histograms. 
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Numerical simulations 

In order to compute the reference distribution P'^^^ (Q) for inter axial packing angles 
we generated random helix pairs by means of computer simulations, and then selected 
them with the same conditions, Cl and C2, used in extracting histograms from 
real helices in native protein structures. Note that when computing the reference 
distribution we did not take steric effects into account; in other words random helices 
might overlap. 

More specifically, we constructed ideal discretized helices with the same geometri- 
cal properties of a-helices in real proteins, i.e. twist per residue 99.1°, rise per residue 
I.45A, radius 2.3^ (as is the case when considering Cq atoms). We chose to generate 
helices consisting of 11 residues, the average length in the dataset of real helices that 
we collected from the PDB. Keeping fixed the position of the first helix, the second 
helix was placed by firstly choosing randomly the midpoint of its axis within a sphere 
of radius ISA centred in the midpoint of the first helix axis, and then selecting, again 
randomly, both the direction of its axis and the twist of its first residue. 

Boundary effects might be relevant, when the radius of the sphere in which the 
second helix is generated is too small with respect to the helix length and helix 
finiteness is effectively reduced. We made sure that our results did not change when 
increasing the radius of the sphere in which the second helix axis is placed. 

We generated 5 • 10^ random hehx pairs, 29915472 out of which satisfied condition 
Cl of having at least one pair of residues distant less then 5.5A. Condition C2 was 
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then applied, in the same way as explained in Methods for the real helices data set, 
with the whole axis being now a segment in the ideal case. In this way we generated 
11467456, 13033815, 14553626 helix pairs, respectively, for the three different Tmax 
values considered in this work, Tmax = 7°, 11°, 15°. 

In Fig. 3 we plot the corresponding random reference distributions P^™''^ 
comparing them with the ideal (Tmax = 0) case: P{0,) ~ sin^ Q. The difference, 
although due to a very subtle effect is substantial, and it is clearly seen that a new 
regime is present for | sinJlj < sin Tmax, as expected from the previous discussion. 

Reanalysing experimental results 

To test the effect of this new reference distribution we have recomputed an experimen- 
tal distribution of interaxial angles (see Methods for details) by analysing a databank 
of 600 proteins and using a contact threshold dc = 5.5A and a tolerance Tmax = 7°, 
corresponding the intrinsic uncertainty AQ — 6.7° estimated when determining local 
axis directions (see Methods). The histogram is reported in the upper panel of Fig. 
4: in order to obtain good statistics for each single bin, we gathered results every 15°. 
The histogram is consistent with previous analysis [6, 7, 8]. In the middle panel of 
Fig. 4 we plot P^°(r2) and, for reference, P{^1) normalized with the same binning as 
in the previous panel. Finally, in the lower panel of Fig. 4 we present the histogram 
both rescaled with P^°(0) and with P{fl). The results are clear and striking: the 
correct distribution P'^°{n) removes the spurious peaks at 0° and ±180°, whereas 4 
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out of the 6 predicted packing angles [6] arise as clean local maxima in the expected 
positions. Only two of the predicted packing angles are clearly not favoured accord- 
ing to our analysis. Remarkably, the two peaks theoretically predicted at 143.9° and 
— 158°, that were seen as shoulders in the unrescaled histogram, are instead placed 
in the correct position (within binning uncertainty) in the rescaled histogram. We 
also note that a residual preference for parallel and antiparallel alignment of two 
contacting helices cannot be ruled out even after proper rescaling. Because of small 
statistics, it is difficult to ascertain whether this is a genuine effect or is just due to 
the tail superposition of different preferential angle peaks. In order to further test 
both the robustness of the results and the full procedure we have obtained statistical 
histograms from the experimental data using three different thresholds, Tmax- Since 
we can compare them with the corresponding random distribution we should expect 
three similar rescaled histograms. This is nicely confirmed by Fig. 5, where the three 
rescaled histograms show a very good overlap, within the statistical uncertainty. We 
also note that generally, the greater the threshold used to relax condition C2, the 
more blurred the peaks corresponding to the sterically preferred angles. This con- 
firms that condition C2 is needed to ensure face-to-face packing and the applicability 
of steric models. 
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Conclusion 

In this paper we have shown that the calculation of the probability distribution of 
interaxial angles between random finite helices which are in contact is not a trivial 
geometric problem, because of the approximations introduced to ensure face-to-face 
packing between contacting helices. Such approximations are unavoidable, due to 
the imperfect shape of natural occurring helices which do not have well defined axes. 
Although analytical results can be found to estimate the correct random distribu- 
tion, the simplest way to obtain it consists in using numerical simulations. We have 
presented a re-analysis of the distribution of packing angles rescaled with our new 
reference distribution and we have found a remarkable agreement with the packing 
angles predicted by steric models [6]. 
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Figure Legends 



Figure 1 

a) The global segment of closest approach (GSCA), of length d, between two straight 
lines is, by definition, perpendicular to both of them. The segment of closest approach 
(SCA), of length dr, between two helices (schematically represented by cylinders) 
surely coincides with the global one if it intersects both helices within their axis 
length. This fact is always guaranteed if the two helices are assumed to have infinite 
length (hypothesis of Ref. [7]). 

b) The two finite helices have a SCA that is not coincident with the GCSA. The 
latter intersects helix 1 at a distance Ali from its end and may (as in the picture) or 
may not intersect helix 2. The former joins the end point of helix 1 to a point inside 
helix 2. The angle formed with helix 1 (^i) is deviating from 7r/2 by an amount Ti. 
Instead O2 — j and T2 — 0. The trigonometric relation expressing Ali as an increasing 
function of Ti can be easily obtained: A/i(t) = dsmn When ti < Tmax, 

sinQY sin^ f2— sin^ n 

condition C2, defined in the text, is still fulfilled. This is equivalent to require that 
All < A/max = A/i(Tniax)- Noticc that A/i(ri) diverges when ti approaches fl from 
below, and consequently, when |sinr2| < sin Tmax, AZmax diverges. In such regime the 
condition C2 (but obviously not CI because dr may also diverge) is always satisfied 
within the allowed tolerance. A similar situation and a similar analysis can be done 
for Ti — and T2 7^ 0. 
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c) GSCA is neither intersecting helix 1 nor hehx 2 and the SCA is joining two 
endpoints. In the case depicted in the figure the two hehces are placed in opposite sides 
with respect to the GSCA. The threshold condition T1+T2 < Tmax may be translated 
into the following one involving Ali and AI2: ((AZi + AI2 cosQ)(AZ2 + AZi cosQ) — 
^{d^ + All sin^ Q) {d? + A/2 sin^ n)) / {{Ak + A/2 cosQ)^d^ + Alj sin^ Q + (A/2 + 
All cos + A/2 sin^ Q) > cot Tmax- Situations in which the two hehces are placed 

on the same side with respect to the GSCA are also possible. In this case the condition 
becomes ((A/i - A/2 cos(])(A/2 - Ah cosQ) - sj{d^ + All sin^ ^) {d"^ + All sin^ ^) ) / 
((A/i - A/2 cos n)^Jd'^ + All sin^ ^ + (A/2 - A/i cos n)^Jd'^ + A/| sin^ Q) > cotr^ax- 

Figure 2 

a) In this figure, the position of hehx 1 (represented by a vector of length hi) and 
a vector orthogonal to the paper plane, determining the direction of the GSCA to 
a second helix, are fixed. Given a particular packing angle Q, in order to satisfy 
condition C2 without any tolerance Tmax, a second finite helix axis (of length 
may then be placed only within a plane parallel to the paper plane and such that its 
starting point is inside the dark shaded parallelogram A, Otherwise the SCA between 
the two finite axes would no longer be perpendicular to both of them. The area of 
parallelogram A is /ii/i2 sin(Q). 

b) The tolerance Tmax with which the condition of perpendicularity between the 
SCA and the two helices is implemented enlarges the portion of space where the 
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starting point of helix 2 can be placed. The region Q formed by the four lightly 
shaded parallelograms and the four white corner regions, which together surround 
parallelogram A, can now be exploited. The two parallelograms parallel to helix 
2, whose sides are /i2 and A = AZmaxSinQ, are due to the geometric arrangement 
described in Fig. lb with ri 7^ and T2 = (the vector with the dashed starting 
point represent the possible location of hehx 2 in such context). The two other 
parallelograms, parallel to helix 1, with sides hi and A, correspond to the similar 
case Ti — and T2 7^ 0. The four remaining white regions, corresponding to the 
case Ti 7^ and T2 7^ 0, are all limited by hyperbola-like curves. The two of them 
subtending an obtuse angle are due to the geometric arrangement described in Fig. 
Ic, corresponding to the two helices being placed in opposite sides with respect to 
the GSCA (the vector with the white starting point represent the possible location 
of helix 2 in such context). The two of them subtending an acute angle correspond 
to the two helices being placed in the same side with respect to the GSCA. In both 
cases it is possible to find the equation describing such boundary curves, which we 
omit here for the sake of simphcity. The area of G is 2{hi + /i2)A plus a non trivial 
contribution coming from the four corner regions. 

c) The distance constraint CI limits by itself the portion of plane surrounding 
parallelogram A where the starting point of the second hehx can be placed. This 
region Q' is simply the locus of points whose Euclidean distance from A in the paper 
plane is less than F = Jdl — dP. For given values of d and the distance threshold dc-, 
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this region is thus formed by the four hghtly shaded parallelograms and by the four 
white circular sectors. The area oig is 2(/ii + /i2)r + 7rr2. 

Figure 3 

Ideal 'random' distribution of interaxial packing angles for four different values of 
the angle threshold Tmax used in applying condition C2; Tmax = 0° (upper left panel), 
Tmax = 7° (upper right panel), Tmax = 11° (lower left panel), Tmax = 15° (lower right 
panel). The four distributions were computed by means of numerical simulations 
described in Methods. Since ideal hehces do not have a preferential order, the his- 
tograms show in this figure have been restricted to the —90° < fl < 90° region. The 
14553626 packing angle values collected in the Tmax = 15° histogram were then succes- 
sively filtered out by the more and more restrictive Tmax = 11°, ^Vnax = 7°, Tmax = 0° 
threshold conditions to generate the 13033815, 11467456, 8646994 data, respectively, 
collected in the corresponding histograms. The dashed fine is the fit of the Tmax = 0° 
data, obtained enforcing condition C2 in a strict way, to the expected P{fl) ~ sin^ fl 
distribution, which is then reported for comparison in all other histograms. All his- 
tograms were constructed with a bin width of 1.5°. Histograms are normalized in 
such a way that a flat distribution would correspond to a constant 1 height. 
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Figure 4 

Upper Panel. Experimental 'bare' unrescaled distribution of interaxial packing angles 
between contacting helices. Data were obtained with a distance threshold d — 5.5A 
for inter helical Cq, atom pairs and an angular threshold Tmax = 7° for condition C2. 
The histogram is normalized in such a way that a flat distribution would correspond 
to a constant 1 height. 

Middle Panel. Ideal 'random' distribution which have to be used to rescale the 
'bare' experimental distribution, in order to reveal true packing angle preferences. The 
two different distributions obtained with either Tmax = 0° (thick line) or Tmax — 7° 
(filled columns) are shown. Histograms are normalized in such a way that a flat 
distribution would correspond to a constant 1 height. 

Lower Panel. Rescaled distribution of interaxial packing angles. The experimental 
'bare' distribution in the upper panel is divided by the ideal 'random' one in the 
middle panel, obtained with either Tmax = 0° (blue line) or Tmax = 7° (filled columns). 
Histogram heights greater than 1 correspond to preferential packing angles, whereas 
heights lower than 1 correspond to disfavoured packing angles. 

All histograms were constructed with a bin width of 15°. Arrows in the upper and 
lower panels mark the values of the six preferred packing angles predicted by Walther 
et al. [6]; r2abc = —37.1°, —97.4°, 22.0°, each represented twice with a periodicity of 
180°. 
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Figure 5 

Rescaled distribution of interaxial packing angles between contacting helices for three 
different values of the angle threshold Tmax used in enforcing condition C2; Tmax = ^° 
(hght grey filled columns), Tmax = 11° (dark grey line), Tmax = 15° (black hne). Each 
histogram was obtained by dividing the experimental unrescaled distribution by the 
corresponding ideal reference one (we used the three distributions represented in Fig. 
3 with a different binning). All histograms were constructed with a bin width of 15°. 
Note that the histograms are not normalized, being computed as the ratio of two 
normalized histograms. The arrows mark the values of the the six optimal packing 
angles predicted by Walther et al. [6] . 
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